BUG: read parquet files with older pytz (DEP: keep lower pytz minimum version) by jorisvandenbossche · Pull Request #65133 · pandas-dev/pandas

jorisvandenbossche · 2026-04-09T09:20:10Z

When we made pytz an optional dependency (#59089), we also bumped the minimum version (and later bumped it once more #62241). This causes issues with reading parquet files when someone does not have this required minimum version (the reported bug in #64978).

While we could solve this by improving the error message you get (so it is clear you have to update pytz), I also think there is not really a need to bump the minimum version here (pytz is mostly in maintenance mode AFAIK, and so the newer versions I assume are mostly updating the tz data)

For tzdata we actually decided to remove the minimum version altogether (#63335), but since pytz is still an API in addition to the tz data, I just kept the minimum version we had before in pandas 2.x (that should then at least not give problems for people upgrading from pandas 2 to 3 without upgrading pytz).

closes BUG: read_parquet fails with tz aware data #64978
Tests added and passed if fixing a bug or adding a new feature
- Not easy to test, but verified it locally with installing an older pytz version.
Added an entry in the latest doc/source/whatsnew/vX.X.X.rst file if fixing a bug or adding a new feature.

jorisvandenbossche · 2026-04-09T10:01:35Z

For the case you would still have an older pytz than 2020.1, I also want to improve the behaviour or error message. At the moment, we have many cases where we don't actually check exactly that we have a pytz object (using treat_tz_as_pytz without checking pytz can be imported).
That means some parts of our API do kind of work with an older version of pytz, although might give wrong results, eg:

>>> import pytz
>>> pd.Timestamp(2012, 1, 1).tz_localize("UTC").tz_convert(pytz.timezone("Europe/Brussels"))
Timestamp('2012-01-01 00:18:00+0018', tz='Europe/Brussels')
# should be Timestamp('2012-01-01 01:00:00+0100', tz='Europe/Brussels')

Although then some other parts raise a error (like what read_parquet ran into).

I could easily make read_parquet also "work" (but return those wrong values), but so it seems to be a better behaviour to actually raise a proper error message up front when trying to use a pytz timezone when your pytz version is too old.

That is what I added in the last commit (but since this is somewhat of a breaking change, I could also keep that for a separate PR for 3.1)

pandas/_libs/tslibs/timezones.pyx

jbrockmendel · 2026-04-09T20:57:32Z

Is it viable to write a test for this?

jorisvandenbossche · 2026-04-09T21:03:15Z

Not directly, since it is for the case where pytz is too old, and we don't have a CI build for that. I don't know if it would be possible to mock the pytz version in a test?

jbrockmendel · 2026-04-09T21:13:58Z

I don't know if it would be possible to mock the pytz version in a test?

I don't think so, especially if pyarrow is going to move to zoneinfo in the foreseeable future. Thanks for taking a look

pandas/compat/_optional.py

mroeschke · 2026-04-10T16:56:40Z

Thanks @jorisvandenbossche

lumberbot-app · 2026-04-10T16:56:48Z

Owee, I'm MrMeeseeks, Look at me.

There seem to be a conflict, please backport manually. Here are approximate instructions:

Checkout backport branch and update it.

git checkout 3.0.x
git pull

Cherry pick the first parent branch of the this PR on top of the older branch:

git cherry-pick -x -m1 398e59c04ac30f4930bdbcdb0208e93e71d5a25a

You will likely have some merge/cherry-pick conflict here, fix them and commit:

git commit -am 'Backport PR #65133: BUG: read parquet files with older pytz (DEP: keep lower pytz minimum version)'

Push to a named branch:

git push YOURFORK 3.0.x:auto-backport-of-pr-65133-on-3.0.x

Create a PR against branch 3.0.x, I would have named this PR:

"Backport PR #65133 on branch 3.0.x (BUG: read parquet files with older pytz (DEP: keep lower pytz minimum version))"

And apply the correct labels and milestones.

Congratulations — you did some good work! Hopefully your backport PR will be tested by the continuous integration and merged soon!

Remember to remove the Still Needs Manual Backport label once the PR gets merged.

If these instructions are inaccurate, feel free to suggest an improvement.

…-comparison * upstream/main: PERF: use lookup instead of hash_inner_join for merge with unique right keys (pandas-dev#64691) BUG : update `SeriesGroupBy.ohlc()` to honor `as_index=False` (pandas-dev#65141) PERF: Use DataFrame-level reductions in DataFrame.agg with list of funcs (pandas-dev#65031) DOC: document required external libraries in read_* I/O docstrings (pandas-dev#65143) DOC: improve MultiIndex.is_monotonic_increasing/decreasing docstrings (pandas-dev#65154) BUG: Raise ValueError for non-boolean numeric_only in DataFrame/Series reductions (GH#53098) (pandas-dev#65131) BUG: Timedelta.round() raises ZeroDivisionError when internal unit is 's' and target frequency is sub-second (pandas-dev#64836) ENH: Add replace method to Index (closes pandas-dev#19495) (pandas-dev#65099) PERF: improve StringArray.isna (pandas-dev#57733) BUG: read parquet files with older pytz (DEP: keep lower pytz minimum version) (pandas-dev#65133) DEPR: deprecate dates-with-datetime64 in _maybe_downcast_for_indexing (pandas-dev#64871) DOC: note that DataFrame.values is not writeable (pandas-dev#65142) CLN: Update groupby observed defaults (pandas-dev#65148) PERF: avoid materializing values[indexer] in Block.setitem (pandas-dev#64251) DOC: update GroupBy.sum/min/max See Also sections (pandas-dev#65144)

lower pytz dependency back to 2020.1 (pandas 2.3)

50cbebc

jorisvandenbossche added this to the 3.0.3 milestone Apr 9, 2026

jorisvandenbossche added the Bug label Apr 9, 2026

jorisvandenbossche requested a review from mroeschke as a code owner April 9, 2026 09:20

jorisvandenbossche added Timezones Timezone data dtype IO Parquet parquet, feather labels Apr 9, 2026

jorisvandenbossche changed the title ~~BUG: read parquet files with older pytz~~ BUG: read parquet files with older pytz (DEP: keep lower pytz minimum version) Apr 9, 2026

error on dtype creation when pytz timezone is passed and pytz is too old

dbe3e15

jorisvandenbossche added 2 commits April 9, 2026 14:38

add whatsnew

dc6176f

reword

19393f1

jorisvandenbossche requested a review from jbrockmendel April 9, 2026 13:06

jbrockmendel reviewed Apr 9, 2026

View reviewed changes

pandas/_libs/tslibs/timezones.pyx Outdated Show resolved Hide resolved

fixup

b3c68f8

mroeschke reviewed Apr 10, 2026

View reviewed changes

pandas/compat/_optional.py Show resolved Hide resolved

mroeschke approved these changes Apr 10, 2026

View reviewed changes

mroeschke merged commit 398e59c into pandas-dev:main Apr 10, 2026
45 checks passed

lumberbot-app bot added the Still Needs Manual Backport label Apr 10, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

BUG: read parquet files with older pytz (DEP: keep lower pytz minimum version)#65133

BUG: read parquet files with older pytz (DEP: keep lower pytz minimum version)#65133
mroeschke merged 5 commits intopandas-dev:mainfrom
jorisvandenbossche:old-pytz

jorisvandenbossche commented Apr 9, 2026 •

edited

Loading

Uh oh!

jorisvandenbossche commented Apr 9, 2026

Uh oh!

Uh oh!

jbrockmendel commented Apr 9, 2026

Uh oh!

jorisvandenbossche commented Apr 9, 2026

Uh oh!

jbrockmendel commented Apr 9, 2026

Uh oh!

Uh oh!

Uh oh!

mroeschke commented Apr 10, 2026

Uh oh!

lumberbot-app bot commented Apr 10, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Conversation

jorisvandenbossche commented Apr 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jorisvandenbossche commented Apr 9, 2026

Uh oh!

Uh oh!

jbrockmendel commented Apr 9, 2026

Uh oh!

jorisvandenbossche commented Apr 9, 2026

Uh oh!

jbrockmendel commented Apr 9, 2026

Uh oh!

Uh oh!

Uh oh!

mroeschke commented Apr 10, 2026

Uh oh!

lumberbot-app bot commented Apr 10, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

jorisvandenbossche commented Apr 9, 2026 •

edited

Loading